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METHOD, ARTICLE OF MANUFACTURE AND 
APPARATUS FOR PERFORMING AUTOMATIC 
INTERMODULE CALL LINKAGE OPTIMIZATION 

BACKGROUND OF THE INVENTION 
Field of the Invention 

[0001] The invention relates to computers and computer software. More particularly, 
the invention relates to a method, article of manufacture and apparatus for performing 
automatic intermodule call linkage optimization. 

Description of the Related Art 

[0002] Computer systems run or execute software programs to implement a variety of 
functions. These software programs or computer programs are conventionally written 
in a high-level language, e.g., C++. In particular, C++ is an object-oriented 
programming language in which programs are created using abstractions and 
constructs to create user-defined classes for defining the methods and variables for a 
particular type of object. AH objects of a particular class are identical in form and 
behavior but contain different data on their variables. 

[0003] The text of a computer program written in such a high-level language is called 
a source code. However, to more efficiently run a computer program, the computer 
program is conventionally converted from source code to machine language. A 
compiler is a computer program that converts or, more particularly, compiles source 
code into machine language. A compiled version of the source code is called an 
object code. 

[0004] As computer systems execute a variety of software programs, there is a need 
to reduce the amount of time required to execute these programs. Many compilers 
currently incorporate some type of optimization to minimize the time for program 
execution, i.e., run-time optimization. In one type of compiler optimization, known as 
"inlining optimization," the compiler replaces a call or invocation of a procedure with the 
instructions of the called procedure. By replacing the invocation of procedures, the 
inlining optimization eliminates the overhead required to call the procedure. However, 
if the source code or program contains many procedure calls, then inlining optimization 
considerably increases the amount of object code in the program. 
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[0005] An automatic implementation of run-time optimization is possible, but is limited 
to procedure calls in the same module in the source code or program. However, such 
an intramodule optimization fails to completely optimize a program having multiple 
modules. 

[0006] An intermodule run-time optimization is implemented by storing certain 
parameters or variables in processor registers to eliminate the need to perform 
memory access during procedure calls or invocations. In this type of optimization, 
known as "argument optimization," a programmer must manually modify a program or 
source code to indicate which procedures within the code are to be optimized. 
However, manual modification of the source code is subject to human errors and is 
difficult to implement. Therefore, there is a need in the art to provide an automatic and 
intermodule compiler optimization. 

SUMMARY OF THE INVENTION 

[0007] The invention provides a method, apparatus and article of manufacture and 
apparatus for performing automatic intermodule call linkage optimization. In one 
embodiment, the run time is optimized for an object code generated from a source 
code. Initially, information is extracted for each procedure call in the source code. The 
extracted information is used to select a call linkage for each procedure call. The call 
linkages are selected to minimize the run time of the object code generated from the 
source code. Once the object code is generated form the source code, the object 
code is run using the selected call linkages for each procedure call. 
[0008] An apparatus comprising a memory and a processor is also provided. The 
memory stores compiler program. The processor comprises a plurality of processor 
registers. Some of these processor registers are configured as parameter registers. 
The processor performs a method upon executing the compiler program. Information 
is initially extracted for each procedure call in a source code. The extracted 
information is used to select a call linkage for each procedure call. The call linkages 
are selected to minimize the run time of an object code generated from the source 
code. The object code is then generated form the source code. 
[0009] Additionally, a computer readable medium storing a software program is 
provided. The software program, when executed by a computer, causes the computer 
to perform a method. Initially, information is extracted for each procedure call in the 
source code. The extracted information is used to select a call linkage for each 
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procedure call. The call linkages are selected to minimize the run time of the object 
code generated from the source code. Once the object code is generated form the 
source code, the object code is run using the selected call linkages for each procedure 
call. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0010] The teachings of the present invention can be readily understood by 
considering the following detailed description in conjunction with the accompanying 
drawings, in which: 

[0011] FIG. 1 depicts a block diagram of a computer system utilized to implement the 
present invention; 

[0012] FIG. 2 depicts a conversion of a source code to an object code by a compiler 
program; 

[0013] FIG. 3 depicts a representation of the source code of FIG. 2; 

[0014] FIG. 4A depicts one data structure used to select call linkages for each 

procedure call; 

[0015] FIG. 4B depicts another data structure used to select call linkages for each 
procedure call; 

[0016] FIG. 5 depicts a flow diagram of a method for optimizing the run time of a 
generated object code; 

[0017] FIG. 6 depicts a flow diagram of a method for creating one embodiment of the 
data structure; 

[0018] FIG. 7 depicts a flow diagram of a method for creating another embodiment of 
a data structure; 

[0019] FIG. 8 depicts a flow diagram of a method for determining the optimal call 
linkages for each computer call; 

[0020] FIG. 9A depicts an exemplary process to implement a procedure call using a 
memory-based style of call linkage; and 

[0021] FIG. 9B depicts an exemplary process to implement a procedure call using a 
register-based style of call linkage. 

[0022] To facilitate understanding, identical reference numerals have been used, 
where possible, to designate identical elements that are common to the figures. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

[0023] FIG. 1 depicts an illustrative computer system 100 utilized in accordance with 
the present invention. The computer system 100 may represent any type of computer, 
computer system or other programmable electronic device, including a client computer, 
a server computer, a portable computer, an embedded controller, and the like. The 
computer system 100 may be a standalone device or coupled to a computer network 
system. In one embodiment, the computer system 100 is an E-Server iSeries 400 or 
AS/400 available from International Business Machines of Armonk, New York. 
[0024] The computer system 100 is shown in a programming environment having at 
least one processor 102, which obtains instructions and data from a main memory 106 
via a bus 104. In one embodiment, the processor 102 may comprise a plurality of 
registers 128i, 128 2 , 128 N (hereinafter 128 N ) for limited storage of information. A 
particular subset of these registers 128n is used to physically implement procedure 
calls. This subset of registers 128 N is herein referred to as "parameter registers". 
[0025] The main memory 106 includes an operating system 108, a compiler program 
110 (hereinafter called "compiler"), and various application programs 112. Additionally, 
the memory 106 comprises source code 114, object code 116, and various data 
structures 117. The main memory 106 may comprise one or a combination of memory 
devices, including Random Access Memory, nonvolatile or backup memory, (e.g., 
programmable or Flash memories, read-only memories, and the like). In addition, 
memory 106 may include memory physically located elsewhere in a computer system 
100, for example, any storage capacity used as virtual memory or stored on a mass 
storage device or on another computer coupled to the computer system 100 via bus 
104. 

[0026] The computer system 100 is generally coupled to a number of peripheral 
devices. In one embodiment, the computer system 100 is illustratively coupled to a 
storage medium 118, input devices 120, and output devices 122. The storage medium 
1 18 is operably coupled to the computer system 100 via a storage interface 124. One 
example of the storage interface is a disk drive, e.g., floppy drive, optical drive, tape 
backup, and the like. The input devices 120 and output devices 122 are coupled to the 
computer system 100 via an input/output interface 126. 

[0027] The storage medium 1 1 8 may comprise either a permanent or removable 
direct access storage device (DASD). The input devices 120 may comprise any device 
utilized to provide input to the computer system 100. Examples of input devices 120 
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include a keyboard, a keypad, a light pen, a touch screen, a button, a mouse, a track 
ball, a speech recognition unit, and the like. The output devices 126 may comprise 
any conventional display screen. Although shown separately from the input devices 
120, the output devices 126 and input devices 120 could be combined. For example, a 
display screen with an integrated touch screen, and a display with an integrated 
keyboard, or a speech recognition unit combined with a text speech converter could be 
used. 

[0028] The operating system 108 is the software utilized to operate the computer 
system 100. Examples of the operating system 108 include IBM OS/400, UNIX, IBM 
AIX, Microsoft Windows, and the like. The compiler 1 10 is a software program that 
translates the source code 114 into the object code 116. More specifically, the 
compiler 110 analyzes the source code 114 and generates the data structures 117. 
The compiler 110 uses the data structures 1 17 to generate the object code 116. As 
such, the compiler 110 may be viewed as having a front-end and a back-end, in which 
the front end generates the data structures 117 from the source code 114 and the 
back-end generates the object code 116 from the data structures 117. 
[0029] The source code 114 comprises one or more programs or files written in a 
programming language or some other code that the compiler 110 may translate into 
the object code 116. Examples of programming languages include Fortran, Ada, 
Cobol, Modula-2, Pascal, Java, Visual Basic, C, C+, C++, and the like. The object 
code 116 comprises one or more files or programs used by the operating system 108 
or a particular application program 112. 

[0030] One important feature in the compiler arts is to optimize the generation of 
object code 1 16 from the source code 114. Such optimization is implemented as a 
"program optimizer" function in the compiler 110. There are different modes or ways to 
optimize the compiler program 110. For example, the optimization may minimize the 
object code 116 generated from the source code 1 1 4. One embodiment provided 
herein optimizes the compiler program 1 10 by improving the runtime performance, 
e.g., minimizing the amount of time to execute the object code 1 1 6 generated from the 
source code 114. To minimize this runtime performance, the compiler program 110 
selects a call linkage that is most efficient for each procedure call in the source code 
114. 

[0031] The call linkage represents a link or relationship between a procedure call 
(caller procedure) and a procedure that is called (callee procedure). In one 
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embodiment, the compiler 110 selects between a memory-based call linkage and a 
register-based call linkage. The processor 102 performs different operations to 
implement these two types of call linkages. In the memory-based call linkage, the 
parameters or arguments of a procedure or subroutine call are "passed in memory." 
Namely, the arguments are initially stored from the registers 128n to memory 106 and 
then loaded from memory 106 back to the registers 128n when the compiler 110 
performs a procedure call. In the register-based call linkage, the arguments of the 
procedure are copied to and from registers in the processor 102. A particular subset of 
registers 128n, called parameter registers, are used during a procedure call. One 
embodiment of a process to implement the memory-based call linkage is detailed in 
FIG. 9B and one embodiment of a process to implement the register-based call linkage 
is detailed in FIG. 9C. 

[0032] The implementation of the memory-based call linkage requires the processor 
102 to perform memory accesses for the storage and retrieval of parameter values. In 
contrast, the implementation of the register-based call linkage requires the processor 
102 to access and copy parameter values to local registers in the processor 102. 
Given the current state of the art, memory access is much slower than the speed of the 
processor 102. Accordingly, the register-based call linkage is much faster than the 
memory-based call linkage under ideal conditions. 

[0033] Although the compiler 1 10 in the above embodiment selected between a 
memory-based call linkage and a register-based call linkage, other embodiments may 
select among different classes of call linkages. Examples of such classes of call 
linkages include a "register stacks 11 call linkage, a "system" call linkage or "operating 
system" call linkage, and a "near versus far" call linkage. In the register stacks call 
linkage, a subset of registers 128 N is configured to operate as a stack. Frames or 
windows of registers are allocated on the stack for each procedure call or invocation 
and are popped from the stack on return from the procedure call. As such, the window 
may overlap during different procedure calls which allows the different caller and callee 
procedures to share information and eliminate the need to save and restore registers 
to and from memory 106 for each procedure call. 

[0034] The system call linkage defines a link or relationship between system routines 
instead of a link between caller and callee procedures in a given programming 
language. Variations of the system call linkage include assigning different linkages to 
invoke an executable object code 1 16 and assigning different linkages for procedures 
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written in different programming languages. In certain programming languages such 
as C or C++, a linkage may be specified with a #pragma statement, e.g., #pragma 
linkage (OS). 

[0035] In the near versus far call linkage, different types of branching linkages are 
used depending on the distance in memory 106 between the caller and callee 
procedures. If the caller and callee procedures are separated within a threshold 
distance in memory 106, then a simpler branching linkage is implemented. If the caller 
and callee procedures are separated further apart in memory 106, e.g., different 
segments of memory 106 or different modules 202i, 202 2 and 202 3 , then a more 
complex branching linkage is implemented. 

[0036] However, there are certain conditions where the implementation of the 
register-based call linkage may result in additional run time delays over the memory- 
based call linkage. For example, if the number of parameters in a procedure call 
exceeds the number of parameter registers, the processing of the additional 
parameters may cause additional processing delays. In such a situation, the 
processor 102 would preferably use the memory based call linkage to implement the 
procedure call. In one embodiment, the compiler 110 determines the call linkages to 
improve the overall runtime performance of the compiler program 110. 
[0037] In general, the routines executed to implement the embodiments of the 
invention, whether implemented as part of an operating system or a specific 
application, component, program, object, module or sequence of instructions, are in 
the compiler program 1 10, or the compiler 1 10. The compiler 110 typically comprises 
one or more instructions that are resident at various times in various memory and 
storage devices in the computer system 100. When read and executed by one or 
more processors 102 in the computer system 100, the compiler 110 causes that 
computer system 100 to perform the steps necessary to execute steps or elements 
embodying the various aspects of the invention. Moreover, while the invention has and 
hereinafter will be described in the context of fully functioning computers and computer 
systems, those skilled in the art will appreciate that the various embodiments of the 
invention are capable of being distributed as a program product in a variety of forms, 
and that the invention applies equally regardless of the particular type of signal bearing 
or computer readable media used to actually carry out the distribution. Examples of 
signal bearing or computer readable media include, but are not limited to, recordable 
type media such as volatile and nonvolatile memory devices, floppy and other 
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removable disks, hard disk drives, optical disks (e.g., CD-ROM, DVD, and the like), 
among others. 

[0038] In addition, various programs and devices described hereinafter may be 
identified based upon the application for which they are implemented in a specific 
embodiment of the invention. However, it should be appreciated that any particular 
program or device nomenclature that follows is used merely for convenience, and the 
invention is not limited to use solely in any specific application identified and/or implied 
by such nomenclature. For example, the computer system 100 is depicted in a stand- 
alone environment, but is also applicable in a networked environment, In particular, 
the call linkages may be selected to optimize the run-time of the object code 1 16 for 
any computer in a computer network. 

[0039] FIG. 2 depicts a conversion of a source code 1 1 4 to an object code 1 1 6 by a 
compiler program 110. The source code 1 14 comprises at least one source program 
that is generally written in a programming language interpreted and translated by the 
compiler 1 10. Specifically, the source code 114 may comprises of one or more 
modules illustratively 202i, 202 2 , 202 n (hereinafter 202 n ). Each module 202 n 
represents a portion of the source code 114. The modules 202 n also contain 
procedure or subroutines that are analyzed by the compiler 110. An exemplary source 
code 1 14 is further detailed with respect to FIG. 3. 

[0040] The compiler 1 1 0 analyzes the source code 1 1 4 to derive data structures 1 1 7. 
In the present invention, the data structures 117 are computer files that describe 
information pertaining to the source code 114. Specifically, the data structures 117 
contain information on the procedures, modules, parameters or arguments, and any 
link or relationship between caller and callee procedures. One embodiment of the data 
structures 1 17 is further described with respect to FIGS. 4A and 4B. 
[0041] The compiler 1 1 0 uses the data structures 1 1 7 to translate the source code 
1 1 4 into the object code 1 1 6. The object code 1 1 6 comprises one or more modules 
204i, 204 2 , 204 n (hereinafter 204 n ). The number of modules 204 n in the object 
code 1 16 is not necessarily the same as the number of modules 202 n in the source 
code 114. In one embodiment, the compiler 110 analyzes the data structures 1 17 to 
determine the optimal call linkages for the caller and callee procedures that are not 
inlined. Such non-inlined procedures are not replaced with the instructions contained 
in the procedure by the compiler 110. The optimization applies to linkages between 
caller and callee procedures in different modules 202 n of the source code. By having 
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the compiler 110 optimize the selection of call linkages in the source code 114, the 
present invention avoids the need for a user to manually indicate call linkages which is 
an error prone process. 

[0042] FIG. 3 depicts one representation of the source code 114. The source code 
1 14 is generally written in a high-level programming language. The source code 1 14 is 
only exemplary in nature and does not limit the scope of the present invention. The 
source code 114 comprises three modules, module A 202i, module B 202 2 and 
module C 202 3 , Each module 202i, 202 2 and 202 3 comprises procedures, i.e., 
procedure definitions, and procedure calls. For example, module A 202i comprises 
procedures calcsum and calcprd, module B 202 2 comprises procedures sum and prd, 
and module C 202 3 comprises procedure main. Additionally, module A 202i comprises 
four procedure calls including two procedure calls to the callee procedure sum and two 
procedure calls to the callee procedure prd. Module C 202 3 comprises two procedure 
calls to the callee procedures calcsum and calcprod. These procedure calls may 
comprise intermodule calls. Namely, a procedure call in one module may call a 
procedure in a different module. For example, a procedure call of procedure calcsum 
is in module C 202 3 which calls the procedure calcsum in module A 202i. 
[0043] A "call linkage" relates each procedure call to its callee procedure. The call 
linkage defines a relationship between the caller and callee procedures in a procedure 
call. Two types of call linkages include a memory-based call linkage and a register- 
based call linkage. These call linkages are implemented during runtime of an object 
code 116 that is generated from the source code 114. The memory based call linkage 
requires memory accesses to execute a procedure call in the object code 116. The 
register-based call linkage uses parameter registers in the processor 102 to implement 
the procedure call. Exemplary implementations of the memory based and register 
based call linkages are further described with respect to FIGS. 9B and 9C. 
[0044] Although the register based call linkage is ideally much faster, there are 
situations when the use of the register based call linkage may cause run time delays. 
For example, if the number of parameters in a procedure call is greater than the 
number of registers available, then using the register based call linkage may cause 
delays in processing the additional variables. The present invention provides an 
optimal selection of call linkages to minimize the overall run time of the object code 
116. 

[0045] FIG. 4A depicts one embodiment of a data structure 1 17 generated by the 
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compiler 110. One possible format of the data structure 1 17 is a database table 
subject to a database query. The data structure 1 1 7 comprises information for each 
procedure or procedure definition in the source code 1 14. Illustratively, the data 
structure 117 comprises a procedure identifier field 402, a module field 404, a 
procedure name field 406, an argument count field 408 and an argument descriptor 
field 410. The procedure identifier field 402 comprises an identifier of a particular 
procedure. The value of this field 402 may be unique within the source code 1 14. The 
module field 404 comprises the name of a module 202 n containing a particular 
procedure. The procedure name field 406 comprises the name of a particular 
procedure. The argument count field 408 comprises the number of arguments or 
parameters in a particular procedure. The argument descriptor field 410 comprises 
information on the types of arguments specified in the argument count field 408. 
[0046] FIG. 4B depicts another embodiment of a data structure 117 generated by the 
compiler. In contrast to FIG. 4A, the data structure 1 17 of FIG. 4B comprises 
information on the procedure calls or caller procedures. Illustratively, the data 
structure 117 comprises a call link identifier field 412, a caller procedure field 414, a 
callee procedure field 416, an argument count field 418 and an argument count field 
420. The call link identifier field 412 comprises an identifier for a particular procedure 
call. The field 412 may be unique for the source code 114. The caller procedure field 
420 comprises the identifier 402 of the procedure where the call was made from. If the 
procedure call is made outside any of the procedures, then either no value or a default 
value is assigned to the caller procedure field 420. The callee procedure field 416 
comprises the identifier 402 of the callee procedure. The argument count field 418 
comprises the number of arguments or parameters used in a particular procedure call. 
The argument descriptor field 420 comprises information on the types of arguments 
specified in the argument count field 418. 

[0047] FIG. 5 depicts a flow diagram of a method 500 for minimizing the run time of a 
generated object code by determining optimal call linkages in accordance to the 
present invention. In one embodiment, the method 500 is an Interprocedural Analysis 
(IPA) option of the compiler program 110. The method 500 starts at step 502 and 
proceeds to step 504, where data structures 1 17 are created for procedures in the 
source code 114. More specifically, step 504 extracts information about the 
procedures from the source code 1 14 and creates data structures 1 17 to include the 
extracted information. One embodiment of step 504 is further described with respect 
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to FIG. 6. The method 500 proceeds to step 506, where data structures are created 
for procedure calls in the source code 114. More specifically, step 506 extracts 
information about the procedure calls from the source code 1 1 4 and creates data 
structures 1 17 to include the extracted information. The data structures 1 17 in step 
504 contain different information than the data structures in step 506. As such, 
different files or different data structures may be created in steps 504 and 506. Step 
506 is further described with respect to FIG. 7. 

[0048] At step 508, the method 500 uses the data structures 1 17 to optimize the call 
linkages. More specifically, step 508 optimizes the call linkages by determining an 
appropriate call linkage for each procedure call, e.g., caller function and called 
procedure. . The call linkages are optimized to minimize the overall run time of the 
object code 116 generated from the source code 114. Step 508 is further described 
with respect to FIG. 8. The method 500 proceeds generate the object code 1 16 from 
the source code 1 14 at step 510 and ends at step 512. 

[0049] FIG. 6 depicts a flow diagram of a method for creating one embodiment of a 
data structure 117 used to optimize call linkages. The method 600 is embodied as 
step 504 described above with reference to FIG. 5. Specifically, the method 600 starts 
at step 602 and proceeds to step 604 where a data structure 1 17 is created for all 
procedures or procedure definitions in the source code 1 1 4. The method 600 
proceeds to step 606 where a module 202 n in the source code 1 14 is considered. At 
step 608, an identifier is assigned for each module 202 n . The method 600 proceeds to 
step 610 where a procedure in the module 202 n is considered. 
[0050] At step 612, an identifier is assigned for the procedure. The method 600 
proceeds to step 614 where information on the procedure is extracted. Such 
information may include, but is not limited to, the identifier of the module 202 n 
containing the procedure, the name of the procedure, the number of arguments or 
parameters in the procedure, and the types of arguments used in the procedure. The 
method 600 proceeds to update the data structure 1 1 7 with the extracted information 
at step 616 and returns to step 610 where the next procedure is considered. After all 
the procedures in a particular module 202 n are considered, the method 600 returns to 
step 606 where the next module 202 n is considered. After all the modules 202 n in the 
source code 1 14 are considered, the method 600 proceeds to end at step 618. 
[0051] FIG. 7 depicts a flow diagram of a method 700 for creating another 
embodiment of a data structure used to optimize call linkages. The method 700 is 
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embodied as step 506 described above with reference to FIG. 5. Specifically, the 
method 700 starts at step 702 and proceeds to step 704 where a data structure 1 17 is 
created for all procedure calls in the source code 114. At step 706, a procedure call in 
the source code 1 14 is considered. The method 700 proceeds to step 708 where a 
query determines whether the callee procedure in the procedure call is an inline 
procedure. If the callee procedure is an inline procedure, the method 600 returns to 
step 706 where the next procedure call is considered, if the procedure is not an inline 
procedure, the method 600 proceeds to step 710, where an identifier is assigned for 
the procedure call. The method 700 then proceeds to extract information on the 
procedure call from the source code 1 14 at step 712, update the data structure 117 
with the extracted procedure call information at step 714, and return to the next 
procedure call in the source code 1 14 at step 706. Once all the procedure calls are 
considered, the method 700 exits at step 714. 

[0052] FIG. 8 depicts a flow diagram of a method 800 for determining the optimal call 
linkages for each procedure call. The optimal call linkages will minimize the run time 
or execution time of a generated object code 1 1 6. The method 800 is embodied as 
step 508 described above with reference to FIG. 5. In one embodiment, the method 
800 determines whether to assign a memory based call linkage or a register based call 
linkage to each procedure call or subroutine call. However, the present invention is not 
limited to the criteria and call linkages depicted in FIG. 8. Namely, the present 
invention may use other criteria and call linkages depending on the hardware and 
operating system 106 used in the computer system 100. 

[0053] The method 800 starts at step 802 and proceeds to step 804 where each 
procedure call in the source code 1 14 is considered. The method 800 proceeds to 
step 806 where a query determines whether the procedure call is known. A procedure 
call may target an unknown callee procedure if the procedure call is made via a 
pointer, i.e., a procedure call through a procedure pointer, instead of a direct procedure 
call. In this situation, the callee procedure is considered unknown, since the value of 
the procedure pointer is not easily determined. If the procedure call is unknown, e.g., 
made to an unknown procedure, the method 800 proceeds to step 808 where a 
memory-based call linkage is assigned to the procedure call. One embodiment of 
implementing the memory-based call linkage is further described with reference to FIG. 
9A. After step 808, the method 800 then returns to step 804 where the next procedure 
call in the source code 1 14 is considered. 
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[0054] If the procedure call is known, the method 800 proceeds to step 810, where a 
query determines whether the parameters in the procedure call or calling procedure 
matches the parameters of the callee procedure. In one embodiment, several 
conditions must exist for a match to occur. First, the calling and callee procedures 
must have the same number of parameters. Second, each parameter in the calling 
and callee procedures must have the same type of parameter and the same length. If 
the match does not exist, then the method 800 proceeds to assign a memory based 
call linkage at step 812 and returns to step 804. 

[0055] If the match does exist, then the method 800 proceeds to step 814, where a 
query determines whether the number of parameters in the procedure exceeds the 
number of parameter registers in the processor 102. The parameter registers are a 
subset of the registers 128n in the processor 102 that are used to perform a procedure 
call of the generated object code 116. Namely, step 814 determines whether the 
number of allocated parameter registers is sufficient to store all the parameters used in 
a procedure call. If the number of parameter registers is insufficient, the method 800 
proceeds to assign a memory based call linkage at step 816 and returns to step 804. 
[0056] If the number of parameter registers is sufficient, the method 800 proceeds to 
step 818 where a query determines whether all types of parameters in the procedure 
can be passed in the registers 128 N . Certain types of data, e.g., pointers, which 
cannot be passed in a register 128 N . If all parameters in the procedure are not 
passable, the method 800 proceeds to assign a memory based call linkage at step 820 
and returns to step 804. If all parameters in the procedure are passable, the method 
800 proceeds to step 822 where a register based call linkage is assigned. One 
embodiment of implementing the register-based call linkage is further described with 
reference to FIG. 9B. As such, the register-based call linkage is assigned only after 
satisfying a preconfigured set of conditions, e.g., in steps 806, 810, 814 and 818. The 
method 800 then returns to step 804. After all the procedure calls in the source code 
1 14 are considered, the method exits at step 824. 

[0057] However, the method 800 is not limited to the conditions in steps 806, 810, 
814 and 818 of FIG. 8. Those skilled in the art would readily realize that a hybrid or 
partial call linkage implementation is also capable. For example, if the number of 
arguments in a procedure call exceeds the number of parameter registers in step 814, 
the method 800 may still implement a register-based call linkage by processing a 
number of arguments equivalent to the number of parameter registers. 
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[0058] FIG. 9A depicts an exemplary process to implement a procedure call using a 
memory-based style of call linkage. The implementation of the procedure call is 
performed during execution of the object code 116. In the memory-based style of call 
linkage, memory is initially allocated for parameters in the procedure call. The values 
of the parameters are stored from the registers 128n in the processor 102 to specific 
control blocks in the memory 106. When the callee procedure is executed, the values 
are loaded back to the registers 128n from the memory 106. 

[0059] FIG. 9B depicts an exemplary process to implement a procedure call using a 
register-based style of call linkage. In the register based call linkage, the values of the 
parameters are initially copied within the processor 102. Specifically, these values are 
copied from the registers 128n to a particular subset of registers 128 N called parameter 
registers. The parameter registers are registers 128n used for the specific purpose of 
implementing the procedure call. Some or even all of the registers 128n are parameter 
registers. In some cases, no copy is required from the registers 128n to the parameter 
registers. When the callee procedure is executed, the values are copied back to the 
registers 128n from the parameter registers. In contrast to the memory-based call 
linkage, the register-based call linkage is usually faster under ideal conditions, since no 
memory access is required for parameters. 

[0060] Although various embodiments which incorporate the teachings of the present 
invention have been shown and described in detail herein, those skilled in the art can 
readily devise many other varied embodiments that still incorporate these teachings. 
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